Search CORE

23 research outputs found

Goal-driven, neurobiological-inspired convolutional neural network models of human spatial hearing

Author: Mehrkanoon Siamak
van der Heijden Kiki
Publication venue
Publication date: 22/01/2022
Field of study

The human brain effortlessly solves the complex computational task of sound localization using a mixture of spatial cues. How the brain performs this task in naturalistic listening environments (e.g. with reverberation) is not well understood. In the present paper, we build on the success of deep neural networks at solving complex and high-dimensional problems [1] to develop goal-driven, neurobiological-inspired convolutional neural network (CNN) models of human spatial hearing. After training, we visualize and quantify feature representations in intermediate layers to gain insights into the representational mechanisms underlying sound location encoding in CNNs. Our results show that neurobiological-inspired CNN models trained on real-life sounds spatialized with human binaural hearing characteristics can accurately predict sound location in the horizontal plane. CNN localization acuity across the azimuth resembles human sound localization acuity, but CNN models outperform human sound localization in the back. Training models with different objective functions - that is, minimizing either Euclidean or angular distance - modulates localization acuity in particular ways. Moreover, different implementations of binaural integration result in unique patterns of localization errors that resemble behavioral observations in humans. Finally, feature representations reveal a gradient of spatial selectivity across network layers, starting with broad spatial representations in early layers and progressing to sparse, highly selective spatial representations in deeper layers. In sum, our results show that neurobiological-inspired CNNs are a valid approach to modeling human spatial hearing. This work paves the way for future studies combining neural network models with empirical measurements of neural activity to unravel the complex computational mechanisms underlying neural sound location encoding in the human auditory pathway

Utrecht University Repository

Goal-driven, neurobiological-inspired convolutional neural network models of human spatial hearing

Author: Mehrkanoon Siamak
van der Heijden Kiki
Publication venue
Publication date: 06/12/2021
Field of study

Maastricht University Research Portal

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Radboud Repository

Utrecht University Repository

BAST: Binaural Audio Spectrogram Transformer for Binaural Sound Localization

Author: Heijden Kiki van der
Kuang Sheng
Mehrkanoon Siamak
Publication venue
Publication date: 08/07/2022
Field of study

Accurate sound localization in a reverberation environment is essential for human auditory perception. Recently, Convolutional Neural Networks (CNNs) have been utilized to model the binaural human auditory pathway. However, CNN shows barriers in capturing the global acoustic features. To address this issue, we propose a novel end-to-end Binaural Audio Spectrogram Transformer (BAST) model to predict the sound azimuth in both anechoic and reverberation environments. Two modes of implementation, i.e. BAST-SP and BAST-NSP corresponding to BAST model with shared and non-shared parameters respectively, are explored. Our model with subtraction interaural integration and hybrid loss achieves an angular distance of 1.29 degrees and a Mean Square Error of 1e-3 at all azimuths, significantly surpassing CNN based model. The exploratory analysis of the BAST's performance on the left-right hemifields and anechoic and reverberation environments shows its generalization ability as well as the feasibility of binaural Transformers in sound localization. Furthermore, the analysis of the attention maps is provided to give additional insights on the interpretation of the localization process in a natural reverberant environment

Utrecht University Repository

Modality-specific brain representations during automatic processing of face, voice and body expressions

Author: Beatrice de Gelder
Kiki Van der Heijden
Maarten Vaessen
Publication venue: Frontiers Media S.A.
Publication date: 01/10/2023
Field of study

A central question in affective science and one that is relevant for its clinical applications is how emotions provided by different stimuli are experienced and represented in the brain. Following the traditional view emotional signals are recognized with the help of emotion concepts that are typically used in descriptions of mental states and emotional experiences, irrespective of the sensory modality. This perspective motivated the search for abstract representations of emotions in the brain, shared across variations in stimulus type (face, body, voice) and sensory origin (visual, auditory). On the other hand, emotion signals like for example an aggressive gesture, trigger rapid automatic behavioral responses and this may take place before or independently of full abstract representation of the emotion. This pleads in favor specific emotion signals that may trigger rapid adaptative behavior only by mobilizing modality and stimulus specific brain representations without relying on higher order abstract emotion categories. To test this hypothesis, we presented participants with naturalistic dynamic emotion expressions of the face, the whole body, or the voice in a functional magnetic resonance (fMRI) study. To focus on automatic emotion processing and sidestep explicit concept-based emotion recognition, participants performed an unrelated target detection task presented in a different sensory modality than the stimulus. By using multivariate analyses to assess neural activity patterns in response to the different stimulus types, we reveal a stimulus category and modality specific brain organization of affective signals. Our findings are consistent with the notion that under ecological conditions emotion expressions of the face, body and voice may have different functional roles in triggering rapid adaptive behavior, even if when viewed from an abstract conceptual vantage point, they may all exemplify the same emotion. This has implications for a neuroethologically grounded emotion research program that should start from detailed behavioral observations of how face, body, and voice expressions function in naturalistic contexts

Directory of Open Access Journals

Imaging-based frequency mapping for cochlear implants – Evaluated using a daily randomized controlled trial

Author: Elke Devocht
Erwin George
Janny Hof
Joke Debruyne
Josef Chalupper
Katja Hellingman
Kiki van der Heijden
Lars Lambriks
Marc van Hoof
Miranda Janssen
Miranda Janssen
Publication venue: 'Frontiers Media SA'
Publication date: 01/04/2023
Field of study

BackgroundDue to variation in electrode design, insertion depth and cochlear morphology, patients with a cochlear implant (CI) often have to adapt to a substantial mismatch between the characteristic response frequencies of cochlear neurons and the stimulus frequencies assigned to electrode contacts. We introduce an imaging-based fitting intervention, which aimed to reduce frequency-to-place mismatch by aligning frequency mapping with the tonotopic position of electrodes. Results were evaluated in a novel trial set-up where subjects crossed over between intervention and control using a daily within-patient randomized approach, immediately from the start of CI rehabilitation.MethodsFourteen adult participants were included in this single-blinded, daily randomized clinical trial. Based on a fusion of pre-operative imaging and a post-operative cone beam CT scan (CBCT), mapping of electrical input was aligned to natural place-pitch arrangement in the individual cochlea. That is, adjustments to the CI’s frequency allocation table were made so electrical stimulation of frequencies matched as closely as possible with corresponding acoustic locations in the cochlea. For a period of three months, starting at first fit, a scheme was implemented whereby the blinded subject crossed over between the experimental and standard fitting program using a daily randomized wearing schedule, and thus effectively acted as their own control. Speech outcomes (such as speech intelligibility in quiet and noise, sound quality and listening effort) were measured with both settings throughout the study period.ResultsOn a group level, standard fitting obtained subject preference and showed superior results in all outcome measures. In contrast, two out of fourteen subjects preferred the imaging-based fitting and correspondingly had better speech understanding with this setting compared to standard fitting.ConclusionOn average, cochlear implant fitting based on individual tonotopy did not elicit higher speech intelligibility but variability in individual results strengthen the potential for individualized frequency fitting. The novel trial design proved to be a suitable method for evaluation of experimental interventions in a prospective trial setup with cochlear implants

Maastricht University Research Portal

Directory of Open Access Journals

Modelling human sound localization with deep neural networks

Author: Mehrkanoon Siamak
van der Heijden Kiki
Publication venue
Publication date: 01/01/2020
Field of study

Maastricht University Research Portal

Data_Sheet_1_Modality-specific brain representations during automatic processing of face, voice and body expressions.docx

Author: Beatrice de Gelder (95743)
Kiki Van der Heijden (17106061)
Maarten Vaessen (4245925)
Publication venue
Publication date: 06/10/2023
Field of study

FigShare

Cortical mechanisms of spatial hearing

Author: de Gelder Beatrice
Formisano Elia
Rauschecker Josef P
van der Heijden Kiki
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2019
Field of study

Humans and other animals use spatial hearing to rapidly localize events in the environment. However, neural encoding of sound location is a complex process involving the computation and integration of multiple spatial cues that are not represented directly in the sensory organ (the cochlea). Our understanding of these mechanisms has increased enormously in the past few years. Current research is focused on the contribution of animal models for understanding human spatial audition, the effects of behavioural demands on neural sound location encoding, the emergence of a cue-independent location representation in the auditory cortex, and the relationship between single-source and concurrent location encoding in complex auditory scenes. Furthermore, computational modelling seeks to unravel how neural representations of sound source locations are derived from the complex binaural waveforms of real-life sounds. In this article, we review and integrate the latest insights from neurophysiological, neuroimaging and computational modelling studies of mammalian spatial hearing. We propose that the cortical representation of sound location emerges from recurrent processing taking place in a dynamic, adaptive network of early (primary) and higher-order (posterior-dorsal and dorsolateral prefrontal) auditory regions. This cortical network accommodates changing behavioural requirements and is especially relevant for processing the location of real-life, complex sounds and complex auditory scenes

Maastricht University Research Portal

Goal-driven, neurobiological-inspired convolutional neural network models of human spatial hearing

Author: Algorithmic Data Analysis
Mehrkanoon Siamak
Sub Algorithmic Data Analysis
van der Heijden Kiki
Publication venue: 'Elsevier BV'
Publication date: 22/01/2022
Field of study

Active sound localization sharpens spatial tuning in human primary auditory cortex

Author: de Gelder Beatrice
Formisano Elia
Rauschecker Josef P
Valente Giancarlo
van der Heijden Kiki
Publication venue: 'Society for Neuroscience'
Publication date: 03/10/2018
Field of study

Maastricht University Research Portal